Computational Sociolinguistics: A Survey

نویسندگان

  • Dong Nguyen
  • A. Seza Dogruöz
  • Carolyn Penstein Rosé
  • Franciska de Jong
چکیده

Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of ”computational sociolinguistics" that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational Dialectology Using Glaps - Automated Processing Of Field Survey Data

Linguistic geography and sociolinguistics have been widely employed among dialectologists in postwar Japan. Over the last ten years, computer-processing of field survey data has become more and more common. The author originally developed the GLAPS processor to produce linguistic atlases by computer. GLAPS has since been modified to ,produce glottograms and crosstables and to handle sociolingui...

متن کامل

The Computational-Linguistic Approach to Forensic Authorship Attribution

This article examines the diversity of methods in authorship attribution through a lens which focuses attention on a single common element. The current state of authorship attribution study is spread throughout so many academic and non -academic disciplines that it is nigh impossible to describe all of the various assumptions about language and authorship. The disciplines involved in authorship...

متن کامل

Statistics in Sociolinguistics

1. Introduction Many areas connected with sociolinguistics in which quantitative data play a role, have seen the application of statistical methods, both traditional (experimental design, sampling, estimation, hypothesis testing) and heuristic (clustering, scaling). Some of these are discussed in other entries (see Social Psychology; Sociometry; Attitude Surveys: Question and Answer Process; Sc...

متن کامل

Social dimensions of language change

Language change results from the differential propagation of linguistic variants distributed among the linguistic repertoires of communicatively interacting individuals in a given community. From this it follows that language change is socially-mediated in two important ways. First, since language change is a social-epidemiological process that takes place by propagating some aspect of communic...

متن کامل

Extracting speaker-specific functional expressions from political speeches using random forests in order to investigate speakers’ individual political styles

In this study we extracted speaker-specific functional expressions from political speeches using random forests in order to investigate speakers’ individual political styles. Along with methodological development, stylistics has expanded its scope into new areas of application such as authorship profiling and sentiment analysis in addition to conventional areas such as authorship attribution an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Linguistics

دوره 42  شماره 

صفحات  -

تاریخ انتشار 2016